Learning Objectives
After completing this tutorial you should be able to
- Define what scientific misconduct is and describe common unethical practices that may or may not fall into that strict definition.
- Define what the ‘reproducibility crisis’ is, common practices that are contributing, and understand how some fields/methodologies are more susceptible.
- Distinguish between different definitions for reproducibility and replicability, and explain how they contribute to the confidence in scientific results being “correct” by the scientific community and public.
Download the directory for this project here, make sure the directory is unzipped and move it to your bi328 directory. You can open the Rproj for this module either by double clicking on it which will launch Rstudio or by opening Rstudio and then using File > Open Project or by clicking on the Rproject icon in the top right of your program window and selecting Open Project.
There should be a file named 20_reproducibility-crisis.qmd in that project directory. Use that file to work through this tutorial - you will hand in your rendered (“knitted”) quarto file as your homework assignment. So, first thing in the YAML header, change the author to your name. You will use this quarto document to record your answers. Remember to use comments to annotate your code; at minimum you should have one comment per code set you may of course add as many comments as you need to be able to recall what you did. Similarly, take notes in the document as we discuss discussion/reflection questions but make sure that you go back and clean them up for “public consumption”.
1 You should do this whether you are adding code yourself or using code from our manual, even if it isn’t commented in the manual… especially when the code is already included for you, add comments to describe how the function works/what it does as we introduce it during the participatory coding session so you can refer back to it.
Frabrication, falsification, and plagiarism (oh my!)
The National Science Foundation defines scientific misconduct in the categories of fabrication, falsification, and plagiarism. Give a brief definition of each term.
Take the set of descriptions that describe hypothetical scenarios of decision making during data generation and analysis. Classify each as either scientific misconduct according to NSF’s definition or not.
Identify scenarios that you are not sure of which category they fall into and be ready to discuss with the class.
2 hypothetical because not referring to a specific case study but these are very realistic scenarios of daily decision making during research.
[Snap a picture, put it in your images folder and then insert it here].
Take your scenarios and rank them along a continuum of ethical to unethical.
Consider these aspects to establish your ranking:
- are some unethical practices worse than others?
- which scenarios do you think are more common than others?
- do you think some are easy to get away with?
- how easy do you think it is to detect if something like this has taken place?
- whose responsibility is it to ensure unethical conduct does not take place during the research process?
After discussion with the class identify at least five major categories of misconduct and unethical behavior and give an example for each. Briefly discuss why for some categories identifying misconduct and unethical conduct is more clear cut while for others it can be difficult to draw a definitive line.
P-hacking and data dredging
Imagine you are a social scientist interested in how political parties impact the US economy.
First, develop a hypothesis of whether Democrats or Republicans being in office positively or negatively impacts the US economy.
Now, use real data going back to 1948 to investigate. To publish your data you would need a statistically significant result. Fortunately you can hack your way to scientific glory using fivethirtyeight’s interactive tool. Describe how you were able to confirm your hypothesis by manipulating which group of politicians to include, how you measured economic performance and other options.
Finally, formulate a second opposing theory and see if you can generate a statistically significant result for that.
3 Pro Tip: Find a p-hacking buddy and test alternate hypotheses and then swap your results!
Congratulations, you just became a successful p-hacker. The practices of p-hacking and data dredging have become increasing common in the era of big data.
Briefly describe what the practices of p-hacking and data-dredging entail.
Briefly describe what the reproducibility crisis is and argue which fields of science you would expect to be more/less heavily impacted and how the increasing availability of large data sets and deployment of complex methods of analysis (including machine learning) have contributed.
You keep using that word, but …
Replicability and reproducibility of studies both generally refer to the practice of validating the results obtained by duplicating them. However, exact definitions of the terms vary among fields of research. Briefly, argue how you would rank different levels of confidence in the results of a study based on whether it was been repeated with the same results using (combinations of) the same or different teams, the same or different experimental set-ups, and/or the same or different data set.
The National Science Foundation (NSF) defines “replicability” as “the ability of a researcher to duplicate the results of a prior study if the same procedures are followed but new data are collected”.
(Goodman, Fanelli, and Ioannidis 2016) propose a framework that defines three categories based on the goals as related to transparency & compete reporting of methods, producing new evidence and drawing the same conclusion. Briefly compare and contrast the categories of methods reproducibility, results reproducibility, and inferential reproducibility.
Briefly discuss how (lack of) reproducibility can undermine confidence in the scientific process from the general public and/or allow special interest groups to manipulate information to intentionally sow distrust.